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Abstract 

Various wireless sensor network applications involve the computation of a pre-defined function of 
the measurements without the need for reconstructing each individual sensor reading. Widely-considered 
examples of such functions include the arithmetic mean and the maximum value. Standard approaches 
to the computation problem separate computation from communication: quantized sensor readings are 
transmitted interference-free to a fusion center that reconstructs each sensor reading and subsequently 
computes the sought function value. Such separation-based computation schemes are generally highly 
inefficient as a complete reconstruction of individual sensor readings is not necessary for the fusion 
center to compute a function of them. In particular, if the mathematical structure of the wireless channel 
is suitably matched (in some sense) to the function, then channel collisions induced by concurrent 
transmissions of different nodes can be beneficially exploited for computation purposes. Therefore, in this 
paper a practically relevant analog computation scheme is proposed that allows for an efficient estimate of 
linear and nonlinear functions over the wireless multiple-access channel. After analyzing the asymptotic 
properties of the estimation error, numerical simulations are presented to show the potential for huge 
performance gains when compared with time-division multiple-access based computation schemes. 
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I. Introduction 



In contrast to traditional wireless networks, wireless sensor networks are deployed to perform 
various application tasks such as environmental monitoring or disaster alarm. Indeed, rather than 
transmitting and reconstructing the data of each individual sensor node, wireless sensor network 
applications often involve the computation of some pre-defined function of these data (called 
sensor readings), which includes the arithmetic mean, the maximum or minimum value, and 
different polynomials JT). In this paper, we address the problem of computing functions over 
a wireless Multiple- Access Channel (MAC) with a fixed number of sensor nodes and a single 
receiver that is referred to as the fusion center. A standard approach to this computational problem 
widely used in contemporary sensor networks is to let each sensor node transmit separately a 
quantized version of its sensor reading to the fusion center as a stream of information-bearing 
symbols. The data rate at which each sensor node transmits is chosen such that the fusion center 
can reconstruct each (quantized) sensor reading perfectly and subsequently computes the sought 
function. The data transmission and the function computation are therefore completely disjoint 
processes. Moreover, in order to perfectly reconstruct each sensor reading, orthogonal medium 
access protocols such as Time-Division Multiple Access (TDMA) are typically used for the data 
transmission to establish interference-free connections between each sensor node and the fusion 
center by avoiding the interference from other transmissions. 

Separation-based medium access protocols are in general highly suboptimal when for instance 
maximizing computation throughput defined as the rate at which quantized sensor readings are 
reconstructed at the fusion center subject to some communication constraints. In particular, the 
information-theoretic result of suggest that the superposition property of the wireless channel 
can be beneficially exploited if the MAC is matched in some mathematical sense to a function 
being computed. The approach, which is known as Computation over MAC (CoMAC), can be 
seen as a method for merging the processes of data transmission and function computation by 
exploiting channel collisions induced by a concurrent access of different nodes to a common 
channel. An immediate consequence of this approach is a higher computation throughput, and 
with it a reduced latency or lower bandwidth requirements. 

The analysis in [2] also shows that in CoMAC scenarios, codes with a certain algebraic 
structure may outperform random codes. One such an example can be found in j3l (see also 0) 
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where a receiver aims at decoding the parity of two dependent binary messages. The code design 
is in this case driven by an application which is the modulo-two sum computation, and therefore 
the example lifts a strict separation between computation and communication. The research 
on structured codes is however still in its infancy, with some work on codes for computing 
functions that naturally match the mathematical structure of the underlying channel. Note that 
due to the superposition property of the wireless channel, the wireless MAC can be seen as a 
summation-type linear operator mapping the input space to the set of complex- valued numbers. 
Hence functions naturally matched to this channel are linear functions that constitute only one 
class of functions of interest in practice. 

In light of practical constraints, a serious drawback of the information-theoretic approach in [0 
and other related results (see also Section H-Al) is the implicit assumption that if two symbols are 
put on the channel input, then the corresponding decoder observes the sum of these inputs. Obvi- 
ously, this is satisfied in additive white Gaussian channels with users perfectly synchronized on 
the symbol and phase level. In practical wireless sensor networks, however, it may be extremely 
difficult and expensive in terms of resources to ensure such a perfect synchronization. Hence, 
even if structured codes were available, the question remained how to exploit the superposition 
property of the wireless channel in the presence of practical impairments. 

In this paper, we propose and analyze a novel CoMAC scheme for wireless sensor applications 
that requires only a coarse block-synchronization, and therefore it is robust against synchroniza- 
tion errors. It is a a simple analog joint source-channel computation scheme, in which 

1) each sensor node encodes its message (sensor reading) in the power of a series of random 
signal pulses, and 

2) the receiver estimates the function value from the received power. 

Another crucial advantage of the proposed analog computation scheme is its ability to reliably 
and efficiently estimate non-linear functions of sensor readings. We achieve such non-linear 
computational capabilities by letting each sensor node pre-process its sensor readings prior to 
transmission, followed by a receiver-side post-processing of the received signal, which is a noise- 
corrupted weighted sum of the pre-processed sensor readings from different sensor nodes. The 
pre-processing functions and the post-processing function are to be chosen so as to match the 
wireless channel with its superposition property to a function that we intend to evaluate at the 
sensor readings. The weights are due to the impact of the fading channel, which needs to be 
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compensated in practical systems. 
A. Related Work 

In the context of sensor networks, viewed as a collection of distributed computation devices, 
Giridhar and Kumar took the first steps towards a theory-based framework for in-network 
computation with the aim of characterizing efficient application-specific computation strategies 
HI. The work is however focused on complexity and protocol aspects and does not explicitly 
take into account the properties of wireless communication channels. A similar holds true for [|4]|, 
[[H, the information theoretical considerations in @| and Q as well as for [H, which is mainly 
devoted to wired networks. In contrast, harnessing the explicit structure of the channel for reliable 
function computations was first thoroughly analyzed in [|2]| with an emphasis on information 
theoretical insights, whereas computations over noiseless linear channels are considered in 0. 

Function computation in sensor networks is a fundamental building block of gossip and 
consensus algorithms, a form of distributed in-network data processing aiming at achieving some 
network-wide objectives based on local computations. Such algorithms, which compute a global 
function of sensor readings and distribute the function values among the nodes, have attracted 
a great deal of attention (see |[T0l - lfT2l and references therein). Most gossip and consensus 
protocols, however, require interference free transmissions between adjacent nodes, except for 
the recent work in [fT3ll - [fT6ll . where it was shown that the superposition property of the wireless 
channel can be advantageously exploited to accelerate convergence speeds. 

In IfTTTl . an analog joint source-channel communication scheme was proposed to exploit 
the superposition property of the Gaussian MAC for the optimal estimation of some desired 
parameter from a collection of noisy sensor readings. The approach outperforms comparable 
digital approaches based on the standard separation design principle between source and channel 
coding, as proposed by Shannon in his landmark paper lfT8l . Extensions of the analog joint 
source-channel scheme to more general estimation problems in wireless networks can be found 
in [fT9l - ll22l . whereas References Il23ll - ll26ll are devoted to the detection counterparts. 

Finally, we point out that the basic idea of physical-layer network coding is to exploit the 
superposition property of the wireless channel as well. Indeed, in contrast to the traditional 
network coding principle applied across the packets on the network layer, the physical-layer 
network coding generates linear codewords immediately on the wireless channel by superimpos- 
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ing electromagnetic waves from different, concurrently transmitting and perfectly synchronized 
nodes ll27Tl-ll29l. 

B. Paper Organization 

Section HI] introduces the system model, formulates the problem and provides definitions used 
in this paper. In Section (TTTl we present a novel analog CoMAC scheme for estimating linear 
and non-linear functions of sensor readings and study the estimation error under the proposed 
scheme in Section [W] This analysis is used to define appropriate estimators for two function 
examples of great practical importance: the arithmetic mean and the geometric mean. Numerical 
examples in Section IVl illustrate the performance of the proposed CoMAC scheme and compares 
it with a TDMA-based computation scheme to show the potential for huge performance gains 
under different network parameters. Finally, Section [VI] concludes the paperll 

II. Definitions, System Model and Problem Statement 

Throughout the paper, all random elements are defined over an appropriate probability space 
(i?, A, P), with sample space Q, er-Algebra A of subsets of Q and probability measure P on A. 
It is assumed that all functions of random variables and stochastic processes are Borel functions 
to ensure that all resulting random elements are well defined. 

We consider a wireless sensor network consisting of K G IN spatially distributed single- 
antenna sensor nodes and one designated single- antenna Fusion Center (FC). Without loss of 
generality (w.l.o.g.) it is assumed that the K nodes are identical and we use /C := {1, . . . , K} to 
denote the set of all sensor nodes (numbered in an arbitrary order). Basically the sensor nodes 
have the task to jointly observe a certain physical phenomenon (e.g., temperature, pressure, 
humidity, acceleration, illumination) and subsequently transmit their suitably encoded sensor 
readings to the FC. We model the sensor readings as time-discrete A'-valued stochastic processes 

1 Notation: Random variables are denoted with uppercase letters, random vectors by bold uppercase letters, realizations by 
lowercase letters and vector valued realizations by bold lowercase letters, respectively. The sets of natural, nonnegative integer, 
real, nonnegative real, positive real, and complex numbers are denoted by IN, Z+, R, R+, Et++, C. The distributions of normally 
distributed real and proper complex random elements are denoted by A/k(-, •) and A/c(', ■)• ') denotes the log-normal 

distribution and Xn the Chi-square distribution with n degrees of freedom, respectively. The error function and error function 
compliment are described by erf(-) and erfc(-). lg(a;) denotes the indicator function on set B. The imaginary unit is denoted 
by i and hence i 2 = — 1. 
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X k : Q x T — > X, (cj, t) i-> X fc [c<;, i], k G /C, where A? := [x min , x max ] for some given x min < x max 
is the underlying compact state space and T is an at most countable set of increasingly ordered 
real- valued measurement times^ Without loss of generality let us assume that X C S C R, where 
<S := [sinin, s max ], s min < s max , is called the sensing range, which is the hardware-dependent range 
in which the sensor elements are able to quantify values. Finally it is assumed that the joint 
probability density p x ■ X K xf4 R+, px(x;t) := px ly ..,x K (xi, ■ ■ ■ , xk] t) G C (X K ), of 
sensor readings X[t] := . . . , X K [t]) T exists, with C (B), B C R n , being the space of 

real-valued compactly supported continuous functions over B. 

A. Wireless Multiple-Access Channel 

The main contribution of this paper is a novel coding scheme that efficiently utilizes the 
superposition property of the Wireless Multiple-Access Channel (W-MAC) to compute functions 
of sensor readings. The W-MAC is defined as follows. 

Definition 1 (W-MAC): For any transmission time r G Z + , the W-MAC is a map from C K 
into C defined to be 

K 

(W 1 [rl W K [r}) i— ► H k[r]W k [r] + N[r] =: Y[r] . (1) 

fc=i 

Here and hereafter 

• Wk[r] G C, k G /C, is the transmit signal of node k with Vr : |H4[r]| 2 < P max , where 
-Pmax > is the peak power constraint on each node, 

• H k [r], k G /C, is an independent complex- valued flat fading process between the k th sensor 
node and the FC and 

• N[t] is an independent complex- valued receiver noise process. 

Note that Wk[r] depends on the k th sensor reading X k [t] G X at any measurement time t G T. 
If H k [r] = 1 and N[r] = 0, the W-MAC takes the form 

K 

(W 1 [r],...,W K [r])^J2 W ^ ' ( 2 ) 

k=l 

which is referred to as the ideal W-MAC. 

throughout the paper we skip the explicit designation of elementary events u) G Q in the formulation of stochastic processes 
and write for example Xk[t] instead of Xk[w,t\. 
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Remark 1: The W-MAC is a symbol-synchronous channel similar to the standard synchronous 
Code-Division Multiple- Access (CDMA) channel studied for instance in 113011 . OTTl . We would 
like to emphasize, however, that the computation scheme proposed in this paper does not require 
such a synchronous channel and the only reason for assuming perfect synchronization is to 
simplify the error analysis in Section [IV] and the notation throughout the paper. 

B. Pre-processing and Post-processing Functions 

As already mentioned, the objective is not to transmit each sensor reading via the W-MAC but 
rather to compute a function of these readings at the fusion center. Throughout the paper, we use 
/ to denote the function of interest and refer to this function as the desired function. Obviously, 
given a realization of X[t] at measurement time instance t E T, we have / : X K — > R, with 
f(xi[t], . . . ,XK[t]) —'■ (/ ° x)[t) = f(x[t}), where /(cc[t]J is the function value which the FC 
attempts to extract from the corresponding observed receive signal. 

The basic idea behind the scheme for an efficient computation of desired functions proposed 
in this paper is to exploit the broadcast property of the W-MAC to allow the FC to observe 
a superposition of signals transmitted by the sensors. A look at Eqs. (OQ) and © shows that 
the basic mathematical operation which can be naturally performed by the W-MAC on the 
sensor readings is addition. In other words, if all sensors send their readings simultaneously 
over the same frequency band, then the FC would receive a weighted sum of the sensor readings 
corrupted by background noised Now the reader may be inclined to think that such an approach 
is inherently confined for computing affine functions, which in fact is true if no additional 
signal processing is carried out at the transmitters and the receiver. In this paper, in order to 
overcome the restriction to affine functions, we propose to perform some pre-processing and 
post-processing at the sensor nodes and the FC, respectively. To this end, we introduce the 
following two definitions. 

Definition 2 (Pre-processing Function): We define <fk : X — > R, ip^ G Co(X), with ipk(xk[t\) = 
((fk o to be a pre-processing function of node k 6 K. 

Definition 3 (Post-processing Function): The continuous injective function ip : R — > R with 

3 In the case of an ideal W-MAC, the FC would observe the uncorrupted sum of sensor readings. 
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X[t], 





tp(x[t]) 












W-MAC 


► 





/(*[*]) 



Matched Channel 



Fig. 1. Block diagram of the overall channel, which is matched to the desired function. The match results from the transformation 
of the W-MAC by the pre-processing functions (^i(a?i[t]), ■ • • , >Pk(xk [t])) J ='■ ip(x[t]) and the post-processing function tp, 
respectively, which depend on the desired function /. 



tion[] 



^(bHI 2 )' where y[r] given by (OQ) is said to be a post-processing function! 

In order to illustrate the above definitions, it is reasonable to consider an ideal W-MAC, in 
which case the objective of the pre- and post-processing functions is to transform the ideal W- 
MAC in such a way that the resulting overall channel mapping from X K into R is equal to the 
desired function. Therefore, ip kl k G /C, and ij) are to be chosen so that 

f(x 1 ,...,x K ) =i>(^2 keK Vk(xk)j , (3) 
where Wk[r] = ifk{xk) is the transmit signal of node k E K, at time r. 

C. Functions Computable via Wireless Multiple-Access Channels 

Figure Q] illustrates the functional principle of the analog computation scheme proposed in 
Section Unl which is referred to as the CoMAC scheme in what follows. Consequently, the space 
of all functions J r (X K ) C C (X K ) that can be computed using the analog CoMAC scheme under 
the assumption of an ideal W-MAC is given by 

F{X K ) := {/ : X K -> R| f(x) = ^(j2 keK M x k)) } ■ ( 4 ) 

The space of all affine functions is clearly a subset of T[X K \ because any affine function can be 
computed if the pre- and post-processing functions are (fk(x) = v^x, h e K, and ip(y) = ay + b, 
for some (z/ 1; . . . , u K ) 6 and a, b e R. With an appropriate choice of the parameters, we can 
therefore compute any weighted sum and, in particular, the arithmetic mean which is of great 

4 The restriction to a class of post-processing functions that take the squared absolute value of the W-MAC output as an 
argument is necessary because of the analog computation scheme proposed in Section [III] In general, the post-processing 
function can be defined on the set of complex numbers. 



DRAFT 



October 11, 2012 



GOLDENBAUM AND STANCZAK: ROBUST ANALOG FUNCTION COMPUTATION VIA MULTIPLE-ACCESS CHANNELS 9 

interest in practice. Moreover, we can easily determine the number of active nodes in a network 
by letting them simultaneously transmit some constant value c > and then post-process the 
received signal by means of ip(y) = -y. 

Now the following two questions arise immediately: 

i) Is the set of all affine functions a proper subset of F(X K )7 

ii) What is exactly the function space JF{X K ) and how can its elements be computed? 

In other words, the first question is one of whether functions other than affine ones are members 
of J r {X K ) and therefore are computable using a CoMAC scheme? The answer is obviously 
positive as we can easily compute the geometric mean of some positive sensor readings by 
choosing ip k {x) = \og a (x),a > l,x > 0, for each k E /C and ip{y) = a^ y . Indeed, with 
this choice of functions, we have f(x) = 4>(J2k£ic ^(^fc)) = (Ylk=i x k)^ '> where the sensor 
readings are positive so that < x m [ n < Xk for each k E JC. The second question in contrast is 
not so easy to answer. Widely considered in wireless sensor network applications is for instance 
the maximum of sensor readings f(x) = max fce £ x k . It is, however, not clear how to compute the 
maximum function using a CoMAC scheme. On the positive side, the maximum function can be 
arbitrarily closely approximated by a sequence of functions in T{X K \ Indeed, it is well-known 
that lim^oo = f(x) = max keK x k , where = {Y J k=i x l) li e F{X K ),x k >0,ke)C, 
and the norms can be computed when (p k (x) = x q , for all k e /C, and ip(y) = y«. 

Recently, it was shown that in fact for every multivariate function there exist pre- and post- 
processing functions such that they can be represented in the form © |T32)| , ||33l . The main 
difficulty, however, lies in a constructive characterization of J 7 (X K ) to determine the pre- 
and post-processing functions for computing arbitrary members of this space. Since an exact 
constructive characterization of © is beyond the scope of this paper, we devote our attention 
to the problem of computing some functions in a robust and practically relevant manner by 
exploiting the natural computational capabilities of the W-MAC. 

III. Analog Function Computation via Wireless Multiple-Access Channels 

Recent results in sensor network signal processing indicate that for many wireless sensor 
network applications, an analog joint source-channel communication architecture can be superior 
to widely-spread separation-based digital approaches ll34l . This is in particular true when the 
processes of sensing, computation and data transmission are highly interdependent in which 
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case they should be jointly considered. In order to exploit the interdependencies by merging 
the processes of computation and communication, traditional analog coding schemes require a 
receiver-side constructive superposition of the transmit signals from different sensor nodes in the 
sense of (OQ) IfPTI , [|20ll . However, such a perfect synchronization at the symbol and phase level 
is notoriously difficult to realize in wireless networks and in particular in large-scale wireless 
sensor networks ll35l . 

Therefore, in this paper, we propose an analog computation scheme that tolerates a coarse 
block-synchronization at the FC, which is by far easier to establish and maintain than the perfect 
synchronization required by traditional approaches. The basic idea of the scheme consists in 
letting each sensor node transmit a distinct sequence of complex numbers of length M e IN 
at a transmit energy that depends on the pre-processed sensor readings. Under some conditions 
and a suitable pre-processing strategy, the received energy at the FC equals the sum of all 
the transmit energies corrupted by the background noise. The coarse block- synchronization is 
needed to ensure a sufficiently large overlap of different signal frames as illustrated in Fig. 
[21 An application of an appropriately chosen post-processing function at the receiver together 
with some simple arithmetic calculations (to ensure certain estimation properties) yields then an 
estimate of the desired function of the sensor readings. 

A. Computation Transmitter 

1 ) Data Pre-processing: As each pre-processed sensor reading is to be encoded in transmit 
energy only, it is necessary to apply a suitable bijective continuous mapping g v : [<£ mm , ¥w] — > 
[0, -P max ] from the set of all pre-processed sensor readings onto the set of all feasible transmit 
powers, with y? min := mm k£lc inf xeS <p k (x), yw := max keK snp xeS <p k {x) and P max being the 
transmit power constraint on each node (see Definition [[])• Note that the mapping depends on 
the pre-processing functions and the sensing range and is independent of k, as the FC does not 
have access to each individual transmit signal but only to the W-MAC output given by ©. We 
call the quantity 



transmit power of node k, and point out that it is a random variable whenever X k [t] is random. 
Moreover, we have Pk[t] < -Pmax- Thus the information to be conveyed to the FC is encoded in 
P k [t], for all k G K and t e T. 



P k [t] := g v (y k {X k [t])) 



(5) 
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2) Random Sequences: The transmit power modulates a sequence of random symbols. In 
what follows, we use 



to denote a sequence of transmit symbols generated by node k at any measurement time t 6 T. 
The symbols of the sequence are assumed to be of the form Sk[m] = e iek ^- m \ m = 1, . . . , M, 
where {Ok\m}} k m are continuous random phases that are independent identically and uniformly 
distributed on [0, 2ir). This implies || ||| = M and a constant envelope of the transmit signal 
(i.e., IS^m]! 2 = 1, for all m, k), which is a vital practical constraint. We have two remarks. 

Remark 2: Note that the assumption of continuous random phases is not necessary for our 
CoMAC scheme to be implemented. Without loss of performance, the phases can take on values 
on any discrete subset of [0, 2tt) provided that it results in a corresponding set of conjugated 
pairs of transmit symbols. 

Remark 3: Instead of optimizing the sequences assigned to different nodes, employing se- 
quences with random phases and constant envelope reduces the overhead for coordination 
and improves scalability when compared to systems with optimized sequences. Notice that a 
corresponding sequence design will probably be different from that for traditional asynchronous 
CDMA systems OTTl . where the objective is to eliminate or reduce the mutual interference. 
CoMAC schemes in contrast have to exploit the interference for a common goal, which is the 
computation of functions of sensor readings. 

3) Transmitter-side Channel Inversion: If a receiver-side elimination of the impact of the 
fading channel may be infeasible, we suggest that each transmitter corrects this impact by 
inverting its own channel. To this end, channel state information is necessary at each transmitter, 
which can be estimated from a known pilot signal transmitted by the FC. In practical systems, 
the pilot signal can also be used to wake up sensor nodes and initiate the computation process. 
With the channel state information at the nodes, each transmitter, say transmitter k, inverts its 
channel by sending 



where Wjt[m] is the W-MAC input of node k G /C at sequence symbol m (see also (QQ))- In ll36Tl 
it is shown that the division by the channel amplitude |f7fc[m]| is sufficient so that channel phase 
estimation is not necessary. 



S k [t) := (S k [l],...,S k [M]) T eG 



<M 



(6) 




(7) 
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Fig. 2. Transmit sequences of nodes sent between measurement times t and t', respectively, without precise symbol- and 
phase-synchronization. The gray rectangle emphasizes the maximum overlapping area. 



The resulting computation-transmitter structure is depicted in Fig. 3(a) 



Remark 4: Notice that any node k with P fc [i]/|if fc [m]| 2 > P max for some m cannot invert 
its channel under the power constraint, and therefore must be excluded from transmissions 
associated with measurement time t E T. One possibility to mitigate the problem is to scale 
down all transmit powers by the same constant so that the power constraint is satisfied. Of course 
this impacts the performance in noisy channels and requires some degree of coordination. We 
are not going to dwell on this point and assume in the following that the set /C is chosen such 
that each node can invert its own channel without violating the power constraint. 



B. Computation Receiver 

As mentioned before (see Remark [U, in order to avoid cumbersome notation and to simplify 
the error analysis in the next section, we assume a perfect synchronization of signals from 
different nodes at the FC. The reader however may easily verify that the proposed CoMAC 
scheme based on a simple energy estimator is insensitive to the lack of synchronization provided 
that a significant overlap of different signal frames is ensured as illustrated in Fig. [2] (i.e., a 
coarse frame- synchronization). We also point out that the assumption of perfect synchronization 
has been widely used when analyzing asynchronous CDMA systems (see OTTl and references 
therein). 

1 ) Received Signal: With this assumption in hand, the W-MAC is a memoryless channel and 
its output follows with © from ® to (1 < m < M, t E T) 

K 

Y[m] = ^PiMSkH + N[m] . (8) 
fc=i 
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For any given it G T, we arrange the symbols in a vector Y[t] := (Y[l], . . . , Y[M]) T G C M to 
obtain the vector-valued W-MAC 



K 



r W = £ j9 v {MXk[t])) S k [t) + N[t] , (9) 



fc=i 



where N[t] := (iV[l], . . . , N[M]) T e C M denotes a stationary proper complex-valued white 
Gaussian noise process, that is N[t] ~ Ac(0, <j 2 n Im), e (Oj °°)- 

2) Signal Post-processing: The observation vector in ® is a basis for estimating the desired 
function value . . . , Jf^ft]). To this end, the receiver first computes the received sum- 

energy given by 



K K K 

\l 



Mj2 p k[t] + J2Y1 VPk[t]Pt[t]S k [t] H S t [t] 



fc=i fc=i e=i 

t+k 



=:Ai,[t]eB. 



(10) 



+ Re{S k [t] H N[t]}+N[t] H N[t] , 

=-.A 2 [t]en 

which can be expressed in a more compact way as 

K 

\\Y[t}\\l = Mj2Pk[t}+A[t] , (11) 

k=l 

where A[t] := Ai[t] + A 2 [t] + /^[t] G R is the overall noise incorporating the three different 
noise sources. 

Before applying the post-processing function, the receiver must remove the influence of the 
function g^, which is used to map the sensing range on the set of feasible transmit powers. 
In other words, if A[t] = 0, then an application of the post-processing function must perfectly 
reconstruct the sought function value, which is expected from any computation or transmission 
scheme. Now an examination of (ITT]) with © shows that given g v ,ip and k E /C, we need 
to apply a function h v : R — > R to (fTTT) such that 

^(m^M^M)))) = tf(j>* (**[*])) = G ?{X K ) ■ (12) 

keK. keK, 

Thus, given some pre-processing and post-processing functions, we can compute any desired 
function of the form © provided that A[t] = and the pair (g^, h^) satisfies (fT2l) . The following 
proposition provides a necessary and sufficient condition for the functions to fulfill (fT2l) . 
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x k [t] 




W-Wl 




h v 















l/H k [m] 



(a) 



(b) 



^e-i/(^[ 




Fig. 3. (a) Block diagram of the CoMAC computation-transmitter of sensor node k £ K,. (b) Block diagram of the CoMAC 
computation-receiver for computing the arithemtic mean (switch position 1) and the geometric mean (switch position 2). Functions 
hip and %j) depend on the choice of the desired function and should be chosen according to the discussion in Section Hl-CI and 
Definition [5] or Definition [6] For brevity, standard radio components (e.g., modulator, demodulator, filters) are not depicted. 



Proposition 1: Let K > 2 be arbitrary. Then, (fl2l) holds with / defined by © for some 
given ip, (fx, . . . , cpx, if and only if g v and h v are affine functions with h v = g^ 1 — c, where the 
constant c G R depends on g v . 

Proof: The proof is deferred to Appendix |Al ■ 

Examples of the data pre-processing functions and the signal post-processing functions for 
the arithmetic mean and the geometric mean can be found in Section IIV-BI and Section IIV-CI 
respectively. 

C. Performance Metric 

The performance of the CoMAC scheme is determined in terms of the function estimation 
error defined as follows. 

Definition 4 (Function Estimation Error): Let / 6 J r (X K ) be the desired function continu- 
ously extended onto S', where S' C S is an appropriate subset of S J Furthermore, let / be 
a corresponding estimate at the FC, / max := sup xeS ,i< f(x) and / min := mi xeS ,K f(x). Then, 
E := (f(X) — f{X))/(f mm — f min ) is said to be the function estimation error (relative to S'). 

Practical systems tolerate estimation errors provided that they are small enough. This means 
that \E\ < e must be satisfied for some given application-dependent constant e > 0. However, 
in many applications, the requirement cannot be met permanently due to, for instance, some 
random influences. In such cases, the main figure of merit is the outage probability F(\E\ > e), 

5 <S' is introduced since it may be impossible to continuously extend / onto the entire sensing range S. 
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which is the probability that the function estimation error is larger than or equal to e > 0. It is 
clear that the smaller the outage probability, the higher the computation accuracy. 

IV. Error Analysis 

This section is devoted to the performance analysis of the proposed CoMAC scheme in the 
presence of noise. First we show that, for sufficiently large values of M, the distribution of 
the computation noise A[t] can be approximated by a normal distribution. Since the function 
estimation error is strongly influenced by the post-processing function ip, and with it on the 
choice of the desired function /, we confine our attention in subsequent subsections to two 
special cases of great practical importance: arithmetic mean and geometric mean. Note that 
these two functions are canonical representatives of the basic arithmetic operations summation 
and multiplication. For both cases, we define appropriate estimators by taking into account 
statistical properties of the transformed overall noise A[t) (transformed by and ip) and prove 
some properties. Without loss of generality, we focus on an arbitrary but fixed measurement 
time instance t G T and therefore drop the time index for brevity. 



A. Approximation of the Overall Error Distribution 

The statistics of the overall noise in (fTTI) play a key role when defining function estimators 
and evaluating the performance of the proposed CoMAC scheme. Since an exact distribution of 
A = A\ + A 2 + A 3 conditioned on the sensor readings X = x is difficult to determine, we 
focus on suitable asymptotic approximations. 

To this end, let us first compute the first and second order statistical moments of A\, A 2 and 
A 3 . As far as A\ is concerned, we have 

K K M N M , 

Al = EE E y/KPtS;[m]S t [m] =2J2J2V P - COS ( »H) , (13) 

k=l 1=1 m=l n=l m=l * ', ', 

e^k =-Z n [m\ 

where N := K(K- 1)/2, P n := P k P t and 0' n [m] := (0i[m]-O k [m]) mod27r the random phase 
difference between nodes k and i at sequence symbol m. The mapping (k, £) i->- n is obtained 
by n = n(k,£) = £+(k-l)K -k(k + l)/2, k = 1,...,K-1 and I = k + 1, . . ., K, respectively. 
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By convolution of the densities of 6^[m] and 0k[m], 0' n [m] is independent uniformly dis- 
tributed over [0, 27r), for all n, m. Hence, the probability density of each Z n [m] in (fl3T) is^ 

Pz (z) = _-L=l (-i,!)^), (14) 

7TV 1 — Z Z 

which is symmetric around zero. So Vn,m : E{Z n [m]} = and 

N M 

Vpx e C (X K ) : E{Z\ X } = 2^^E{pj}E{Z rt [m]} = . (15) 

n=l m=l 

Furthermore, 

M A 

Var{Z\ 1 } = 4^^E{P rt }Var{Z n [m]} =2M^E{P n } (16) 

n=l m=l n=l 

since Vm,n 7^ n' : Cov{Z n [m], Z n i[m]} = and Vm,n : Var{Z n [m]} = 1/2, where the latter 
can be concluded by considering (fl4l) . As for the second error term A 2 , we have 

K K 2M 

A 2 = 2j2v r Pk~Re{S%N} =2^^v^^iV; (17) 

fc=i fe=i £=1 

where for any odd £, Uke '■= cos(6>fc[m]), N' e := Re{iV[m]} and U^e '■= sm(0 k [m\), N' e := 
Im{iV[m]}, for any even I (m = 1,...,M). Notice that W : N' t ~ A/" R (0, t^a) and the 
probability density function of [/j^ is given by (fl4)) . Because and [7j^ are zero mean and 
independent for all k, £, it follows for the expectation value 

K 2M 

Vpx e C (X K ) : E{A 2 } = 2^^E{v / Pl}E{[/ fc ,}E{iV;} = . (18) 

k=l l=\ 

Arguing along similar lines as in the case of A\, the variance of A 2 can be easily shown to be 

K 2M K 

Var{z4 2 } = 4^^E{P fc }Var{£7 w }Vax{JV;} = 2Mcx^E{P fe } . (19) 

k=l 1=1 k=l 

Since A 3 = J2 m \N[m]\ 2 ~ xlw we Anally conclude E{A 3 } = Ma 2 N and 

\ax{A z } = Ma% . (20) 



Lemma 1: A\, A 2 and A 3 are mutually orthogonal (in the Hilbert space of random variables 
with the inner product defined to be (A,-, Af) = ¥.{AjAj>}) for all px & Cq(X k ). 

6 Note that by the definitions, all the probability density functions and expected values in this section exists. 
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Proof: Since the sensor readings, the sequence symbols and the noise are mutually indepen- 
dent random variables with Vm : E{iV[m]} = 0, a straightforward calculation of the covariances 
between A 1 and A 2 as well as between A 2 and A 3 proves the lemma. ■ 
The above derivations show that \/px £ Cq{X k ) : E{Z\} = Ma 2 N , while, by Lemma [Q the 
variance of A is the sum of the variances ([Tot , (fT9l ) and (|20l) . Thus, 

N K 

a\ := Var{Z\} = 2M ^ E{P n } + 2Ma 2 N E { p k} + Ma% (21) 

n=l k=l 

and note that when conditioned on X = x, the variance in (|2TI) yields 

N K 

a% x := E{(A - Ma 2 N ) 2 \X = x} = 2M^p n + 2Ma 2 N ^p k + Ma% . (22) 

n=l fc=l 

As mentioned in the introduction to this section, we were not able to find out the exact 
distribution of the overall noise A, which includes various terms with different distributions. 
However, since the number of summands J := K[K — l)M/2 + 2KM + 2M in the definition 
of A is already relatively large for small values of K and M, we argue that it is well-founded 
to invoke the central limit theorem so as to approximate the conditional distribution by a normal 
distribution. The following proposition proves the corresponding convergence as M — > oo. 

Proposition 2: Let A\x be the overall noise according to (flOl) and (flTI) conditioned on the 
sensor readings X = x with E{Z\ | X = x} = Ma%, < a 2 N < oo, and cr% x as defined in 
(1221) . Then, for any fixed K, P max < oo and a compact set X, we have 



\/x G X 



K 



A\x-Ma% d 



A/r(0,1) 



(23) 



as M — > oo, where — > denotes the convergence in distribution. 

Proof: Since the sum terms of A\x are neither identically distributed nor independent, the 
convergence to a normal distribution is not clear. Let us therefore rearrange the sum to obtain: 

N M K 2M M 

A\x = A 1 \x + A 2 \x + A 3 = J2J2 V^cos(©;[m]) + 2 ^ ^ y/plU ke N' e + ^ \N[r 



771 



n=l m=l 



k=l 



m=l 



M 

E 



N 



K 



n=l 



J2 \/Kcos(e' n [m\) + J2{ Re { N i m }} cos(0 fe [m]) • • • 

k=l 

+ lm{N[m]}sxn(O k [m]) ) + |A^[m]| 2 



M 



m=l 
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This makes clear that A m , m = 1, . . . , M, are independent and identically distributed nondegen- 
erate (i.e., Varj/li} > 0) random variables. Moreover, for any K, P max , ajj < oo and a compact 
set X, E{vl| | X = x} = 2 fen=i Pn + Ef=i Pk + <4r) i s finite- Hence the proposition 
follows from Theorem 3 in [37, p. 326] with ((221) and E{A | X = x} = Ma%. ■ 
Since Proposition [2] implies the uniform convergence of the sequence of distribution functions 
associated with {Z\|2:}a/ 6 ]n, we can conclude that the distribution of A\x can be approximated 
by a normal distribution provided that M is sufficiently large. This is summarized in a corollary. 

Corollary 1: If M is sufficiently large, A\x is close to A\x ~ J\f^(Ma%, cr 2 A \ x ) in distribution. 
We point out that determining the convergence rate is beyond the scope of this paper, extensive 
numerical experiments (see Section [V) suggest that the approximation stated in Corollary Q] is 
justified already for small values of M and most cases of practical interest. 



B. Arithmetic Mean Analysis 

First, we define a suitable arithmetic mean estimator based on the observation of the channel 
output energy ||Y"||! given by (fTT|) . Subsequently, we analyze the outage probability under the 
proposed estimator. 

Definition 5 (Arithmetic Mean Estimate): Let / be the desired function "arithmetic mean" and 
let the expected value ~K{ip(A 3 / (Ma m n))} be known to the FC. Then, given M, the estimate 
!m(X) of /(X) is defined to be 

f M {X) := -0(^(11^111)) -E{^(4/(aari,M))} . (24) 

Assuming M g v ((p k (x k )) = MJ2 k p k =: z and a^t ■= , ma ^ min , we have 

. Data pre-processing: Wk : (p k (x) = x,g v (x) = a arit (x - s min ), y2 min = s min , y? max = s max , 
• Signal post-processing: ift(x) — x/K, x e R, hAz) = M l z + Ks m \„ . 

Now, we prove two propositions to show that the arithmetic mean estimator of Definition [5] 
provides two most desired properties: unbiasedness and consistency. The resulting computation- 
receiver is depicted in Fig. |3(b)| with the switch in position 1 . 

Proposition 3: The function value estimator of Definition [5] is unbiased, that is, we have 
VxeX K : E{f M (X) \X = x} = f(x). 
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Proof: With the definitions introduced in Section III-CI and Definition [5] in mind, we can 
write (EH) as f M (X) = f(X) + —^(A - Ma 2 N ). From this, it follows that E{f M (X) | X = 
a;} = f(x) + - ^, A/ (E{Z\ | X = x} — Ma 2 N ) = f(x). So the proposition follows since Vx G 
X K : E{A | X = x} = E{A 3 } = Ma 2 N . ■ 
Proposition 4: Let K, P maxi a 2 N < oo be arbitrary but fixed, and let {/a/}a/gm be a sequence 
of estimators (|24l) . Then, the arithmetic mean estimator / of Definition \5\ is consistent, that is 
Ve>0:lim M ^ oo P(|A / -/|>e) = 0. 

Proof: Let c := l^a^K) > and e > be arbitrary and fixed. By the preceding proof, 
we know that E M := f M - f = jfc(A - E{Z\ 3 }). Hence, as E{A} = E{Z\ 3 }, we obtain 

f(\E M \ > e) = ¥(Ejj > e 2 ) = P(c 2 M~ 2 (Z\ - E{A 3 }) 2 > e 2 ) 

< c 2 (Me)- 2 E{(A - E{A}) 2 } = c 2 (Me)- 2 Var{Z\} , (25) 



where we used Markov's inequality B7l p. 47] (also called Chebyshev's inequality). By (1211) . 
we have for K, P max , a% < oo that Var{Z\} e O(M) so that the right-hand side of the above 
inequality goes to zero as M tends to infinity. Since e > is arbitrary, this completes the proof. 

■ 

Since the upper bound in (1251) typically provides rather loose bounds for finite values of M, we 
cannot use it to approximate the outage probability. It turns out that a better approach is to invoke 
Proposition [2] and approximate F(\E\ > e) by using a transformed normal distribution. Note that 

as / m ax = sup^A- f(x) = s max and / min = sup^x f(x) = s mia with / being continuously 
extended onto S, we have 

E\x = (f M (x) - f(x))/(s max - s min ) = (A\x - MvD/a'^ , (26) 

where a' aiit := MKP max . 

The Mann-Wald theorem [1371 p. 356] guarantees that, for any real continuous mapping h = 
h(x), one has h(X n ) A h(X) whenever X n A X. We can therefore conclude from Corollary Q] 
that for sufficiently large values of M, E\x in (|26|) can be approximated by a random variable 

2 

E\x ~ A/" R (0, with conditional distribution function P^{e\x) := F(E < e\X = x) = 
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Since the absolute value is also continuous and F(\E\ > e\X = x) = 1 — P^(e\x) +P§{— e\x) 
for any e > 0, we obtain for sufficiently large M, 



¥(\E\ > e) « ¥(\E\ > e) = ¥(\E\ > e \ X = x)p x (x) dx 

Jx K 

= eric(a' arit e/{2a 2 Alx )^)p x (x)dx , (27) 
Jx K 

where we used the fact that erf (—a;) = — erf(x) for all x E R. 

C. Geometric Mean Analysis 

As in the preceding subsection, we first define an estimator for the desired function geometric 
mean including the required data pre-processing and signal post-processing functions. 

Definition 6 (Geometric Mean Estimate): Let / be the desired function "geometric mean" as 
defined in Section Hl-Cl and let the expected value E{ip(A 3 /a geo )} be known to the FC (see 
Lemma [2] below). Then, given M, the estimate fuiX) of f(X) is defined to be 

f (x , . = HhMYWD) = w Y x iP(A/(a seo M)) 

lM( h E{^(A 3 /(a geo M))} J[ j E{^(^ 3 /(« g eoM))}- 1 ^ 

Assuming M^ 9f) (^(i t )) = MJ2 k Pk =■ * and a geo := loga{Sm ^ loga{s , y we have 

• Data pre-processing: If s min < 0, choose an arbitrary but fixed s' such that < s' < x m ; n < 
s max and otherwise s' = s min . Then, \/k : ip h (x) = log a (a;), a > 1, (p min = log a (V) and 
^max = log a (s max ), and VA; : ^(log a (x)) = a geo (log a (x) - log a (V)). 

• Signal post-processing: ip{x) = a x ' K , x G R, h v (z) = — z + K\og a (s'). 

The resulting computation-receiver is shown in Fig. |3(b)| with the switch in position 2. As 
mentioned, our estimator requires the knowledge of E,{?p(A 3 /(a geo M))}, which is explicitly 
given in part (i) of the following lemma. Part (ii) is used in the proof of Proposition [5J 

Lemma 2: Let a > 1 be given and fixed, and let a geo be as in Definition [6l Suppose that 
a%\og e (a) < a geo KM. Then 
(i) A A/ :=E{^(Z\ 3 /(a geo M))} = 



(ii) lim Am 

M->oo 



<*N lo gc(") 



a ge0 KM-a 2 N log e (a) 



M 



Proof: The proof is deferred to Appendix [Bj ■ 
We point out that the expected value Am exists if cr^log e (a) < a geo KM holds, which is 
usually fulfilled in practical situations and therefore assumed in what follows. With the estimator 
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of Definition [6l the fuction estimation error conditioned on the sensor readings X = x becomes 

E\x = -^E\x - (3(x) = 0{x) - l) , (29) 

where we used the following notation: / max = s max , f min = s' (0 < s' < x min ), f3(x) := 
f(x)/(s max - s'), 7(33) := X M //3(x) and S\x := ip(A\x/(a geo M)). 

Note that the estimator of Definition [6] is not necessarily unbiased but it offers the advantage 
of a simple implementation in practical systems. In contrast, the estimator 

f M (X) = i;(h^\\Y\\l))E{^(A/(a geo M))}- 1 (30) 



is unbiased but not applicable in practice, because in contrast to the expected value in (1281) 
depends E {^(Z\/(a geo M)) } in (1301) on the overall noise, and thus on the distribution of the 
sensor readings, which is usually unknown at the FC. 

Although the estimator is not unbiased, the following proposition shows that it is (weakly) 
consistent, and therefore asymptotically unbiased. 

Proposition 5: For any fixed K, P max , a 2 N < 00, the geometric mean estimator proposed in 
Definition [6] is consistent. 

Proof: Let K, P max , a% < 00 and e > be arbitrary and fixed. Let {/m}msin be the 
sequence of estimators given by (1281) . We show that the outage probability F(\E\ > e) — >■ as 
M -> 00. To this end, consider P(|£|a;| > e) := P(|£| > e\X = x) for any x e X K , and 
note that f(x) > 0, f3(x) > 0, X M > and S x := E\x > 0. By <[29]), we have ¥(\E\x\ > e) = 
F(S X /X M > 1 + e/p{x)) + P(l - S X /X M > e/P(x)). An application of Markov's inequality 
ll37l p. 47] yields an upper bound on the first sum term: 

F(S X /X M > 1 + e/Pix)) = F(log e ( & ) > log e (l + e/ M )) < ^ff^J^ 

(31) 

By (ii) of Lemma H we have lim^oo log e (X M ) = log^limM^oo X M ) = ■ B Y the 

results on the distribution functions of random variables that are functions of other random 
variables OH pp. 239-240], we obtain E{log e (^)} = ]2E jr l ^{^i} = ^S^ ' wnere we 
used E,{A I X = x} = Ma 2 N in the last step. Combining the results shows that the upper bound 
in (13TT) tends to zero as M — > 00. As for P(l — E X /X M > e/(3(x)), note that we can focus 
on e//3(x) < 1 since S x /Xm > 0. With this in hand, we have P(l — S x /Xm > e /fi( x )) — 
m/^x > 1/(1 — e /fi( x )))- Proceeding essentially along the same lines as above shows that 
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this probability goes to zero with M — > oo. Now, by compactness of X and Theorem 3 or 
Theorem 4 of 03 p. 188], we have lim w _ +0O P(|£| > e) = lim Af _ Hx> E{P(|£7 Af | > e\X = 
a;)} = E{lim M _, oo P(|E|>e|X = a;)}->-0. ■ 

Remark 5: Notice that the proposition implies that the proposed geometric mean estimator 
(|28T) is asymptotically unbiased, that is, we have liniM^oo E{/(X) | X = x} = f(x). As a 
consequence, the proposed estimator (1281) is asymptotically equivalent to (1301) . 

Unfortunately, F(\E\ > e) cannot be exactly evaluated because we are not able to determine 
the distribution function of \E\ = ^(X)" 1 ^ — /3(X)\. For this reason, as in the preceding 
subsection, we approximate the distribution of S \ x by a transformed normal distribution since 
in contrast to the arithmetic mean case depends S\x nonlinearly on the conditioned overall noise 
A\x. 

Lemma 3: Let K < oo, X be compact and M sufficiently large. Then, E\x can be ap- 
proximated by a random variable S\x ~ £Af([i~,(j'l\ x ), where fi s = a%\og e (a) / (a ge0 K) and 
°s\x = a A\ x ( lo S c (a)) 2 /(a geo K) 2 , respectively. 

Proof: The proof can be found in Appendix O ■ 

With Lemma [3] in hand, we are now in a position to prove the main result of this section. 

Proposition 6: Consider the proposed geometric mean estimator (|28l) and suppose that E is 
the corresponding function estimation error. Let ji~ and a^ x be given by Lemma [U and let 
f3(x),y(x) > be as defined in (|29l ). Then, for M sufficiently large, the outage probability 
P(|-E| > e), e > 0, can be approximated by 



\E\ > e) 



\E\ > e) 



X K 



P(|S| > e\X = x)px(x) dx 



(32) 



with 



\E\ > e\X = x) 



2 -f erf ( logc( - p ( x > e ))-vs \ _ er £ ( iog c (p + Q,Q)-Mg 

1 or f „ ( log (p + (a;,€))-fe 

2 erIC I V2a s]x 



< e < 0(x) 



/3(x) < e < oo 
(33) 

where p~(x,e) := y(x)(/3(x) — e) and p + (x,e) := y(x)(/3(x) + e), respectively. 

Proof: The proof is deferred to Appendix [D]. ■ 
In Section IV-Al we choose a particular density px( x ) and evaluate (|32|) numerically to indicate 
the accuracy of the approximation for different network parameters. 
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V. Numerical Examples 

The objective of this section is twofold. First, we show in Section IV-AI that the approximations 
of Section [IV] are very accurate, and second, we compare in Section IV-BI the proposed analog 
CoMAC scheme with a TDMA-based scheme to indicate the huge potential for performance 
gains in typical sensor network operating points. 

As a basis, we consider a classical environmental monitoring scenario in which the FC is 
interested in the arithmetic mean or geometric mean of temperature measurements taken by 
a number of sensor nodes distributed over some geographical area. We assume that all nodes 
are equipped with a low-power temperature sensor supporting a typical sensing range S = 
[-55 °C, 130 °C] 11381. 



A. Approximation Accuracy 

To assess the accuracy of the approximated distributions, we consider two scenarios: one 
where the FC estimates the arithmetic mean, and one where the geometric mean is desired. We 
compare (|27l) and (l32l) with Monte Carlo evaluations of the outage probability F(\E\ > e) based 
on 10 • 10 3 realizations. Note that for both simulation examples, P max and a 2 N have been chosen 
in agreement with commercial IEEE 802.15.4 compliant sensor platforms ll39l . 

Example 1 (Arithmetic Mean): Let M = 25, 50, 150, 250, the number of nodes K = M and 
the sensor readings uniformly and i.i.d. in X = [1 °C, 30 °C] C S. The resulting experimental 



data is depicted in Fig. |4(a) 



The plots in Fig. |4(a)| indicate that expression (|27l) accurately approximates the true outage 
probability ¥(\E\ > e) for all e > 0, since already for relatively short sequence lengths, 
differences between the analytical expression and the Monte Carlo simulations are negligible. 
Furthermore, the plots numerically confirm the consistency statement of Proposition 0] because 
the probability curves tend to the ordinate axis with growing M. 

Example 2 (Geometric Mean): Let S' := [s f , s max ] = [0.5 °C, 130 C1 C S, a = 2, X = 
[1 °C, 30 °C] C S' and all other simulation parameters as in Example \T\j The resulting experi- 
mental data is depicted in Fig. |4(b)[ 

7 Notice that the corresponding function estimation error relies on S' since desired function geometric mean can not be 
continuously extended onto the entire sensing range S. 



October 11, 2012 



DRAFT 



24 



SUBMITTED TO IEEE TRANSACTIONS ON COMMUNICATIONS 




Similar as for Example [H the plots in Fig. |4(b)| show that (|32l) with (|33l) approximates the true 
outage probability sufficiently accurate, with a negligible deviation for short sequence lengths. 
In Section IIV-CI we mentioned that although the geometric mean estimator (1281) is applicable in 
practice, it has the drawback of only an asymptotic unbiasedness compared to the impractical 
unbiased estimator (1301) . Nevertheless, besides a comparison of a Monte Carlo evaluation of 
W(\E\ > e) using (|28|) with the analytical result (|33l) . the figure also contains a plot in which 
(1301) was used to quantify the drawback. The difference between (1281) and (|30l) vanishes quickly 
with increasing M , which confirms Proposition [5] and Remark [51 

Remark 6: Propositions @] and [5j as well as Examples Q] and [2] demonstrate that the sequence 
length M is the crucial design parameter, which determines the trade-off between computation 
accuracy and computation throughput. 

B. Comparisons with TDMA 

The numerical examples in the preceding subsection indicate the general behavior of the 
proposed analog computation architecture without concrete evidence regarding the computation 
performance compared to standard multiple-access schemes. Therefore, we demonstrate in this 
subsection the superiority of the proposed CoMAC architecture by a comparison with an ideal- 
ized uncoded TDMA scheme. For TDMA, the individual nodes quantize their sensor readings 
uniformly over S with Q 6 IN bit, followed by binary phase shift keying, such that each sensor 
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has to transmit a bit stream of length Q to the FC. 

To ensure fairness between CoMAC and TDMA, with fixed degrees of freedom (e.g., band- 
width, symbol duration), both schemes should induce the same costs per function value computa- 
tion with respect to transmit energy and transmit time. Therefore, let T G R++ be the common 
symbol duration and let Ptdma./c G denote the instantaneous TDMA transmit power on 
node k G K. Then, the transmit times per function value are T CoMAC = MT and T TDMA = QKT, 
whereas the transmit energies can be written as E Co MAC,fc = MP k T and E TDM A,fc = QPjvma^T, 
respectively. Now, from the fairness conditions T Co mac = Ptdma and E CoMACj a: = E TDMA)fe , for all 
k G /C, it follows M = QK for the CoMAC sequence length and P TD MA,fc = ^ = 9v(ipki * k))M , 
k G /C, for the required instantaneous TDMA transmit powers. 

In addition to fairness, requires an adequate comparison the determination of a common system 
operating point, which can be done in terms of an average Signal-to-Noise Ratio (SNR). Assume 
for simplicity that the sensed values X k are i.i.d. in X, for all k, such that the average received 
TDMA-SNR per node can be defined as 

SNR; ™i , (34) 

which depends on the desired function. 

Example 3 (Small Network Size): Let K = 25, Q = 10 bit, the sequence length M = QK, 
and let P max and a% be chosen such that SNRf := 10 log 10 (SNR / ) G {0, 2, 4, 6, 8, 10}. Further- 
more, let the sensor readings be uniformly and i.i.d. in X = [5 °C, 30 °C] C S and let the desired 



function be "arithmetic mean". The corresponding simulation data is depicted in Fig. |5(a) 

Example 4 (Medium Network Size): Let K = 250, the desired function be "geometric mean" 
with S' = [1 °C, 130 °C] C S, a = 2 and let all other simulation parameters as in Example HI 
The corresponding simulation data is shown in Fig. |5(b)[ 

Figures |5(a)| and |5(b)| indicate the huge potential of the proposed analog CoMAC scheme for 
efficiently computing linear and nonlinear functions over the wireless channel. In both examples, 
CoMAC entirely outperforms TDMA with respect to the computation accuracy for different 
network parameters. It should be clear that the shown performance gains can be replaced by a 
computation throughput gain. 

Remark 7: It is important to emphasize that the shown performance gains are quite conser- 
vative since the simulated TDMA scheme was idealized in many ways. For example, a realistic 
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Fig. 5. CoMAC vs. TDMA: outage probabilities for quantization with Q = 10 bit (in the case of TDMA), sequence length 
M = QK, and SNRf = 0, 2, 4, 6, 8, 10 dB. 



TDMA would require an established protocol stack with considerable amount of overhead 
per frame (e.g., header, synchronization information, check sum) such that the overall TDMA 
transmission time would extend to T TDMA = (Q + R)KT with a certain R e IN. 

VI. Conclusion 

In this paper, we proposed a simple analog scheme for efficiently computing functions of 
the measurements in wireless sensor networks. The main idea of the approach is to exploit the 
natural superposition property of the wireless channel by letting nodes transmit simultaneously 
to a fusion center. Applying an appropriate pre-processing function to each sensor reading prior 
to transmission and a post-processing function to the signal received by the fusion center, which 
is the superposition of the signals transmitted by the individual nodes, the approach allows the 
analog computation of a huge set of linear and nonlinear functions over the channel. To relax 
corresponding synchronization requirements, the nodes transmit some random sequences at a 
transmit power that is proportional to the respective pre-processed sensor information. As a 
consequence, only a coarse frame synchronization is required such that the scheme is robust 
against synchronization errors on the symbol and phase level. The second essential part of the 
scheme consists of an analog computation-receiver that is designed to appropriately estimate 
desired function values from the post-processed received sum of transmit energies. Since the 
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estimator has to be matched to the desired function, we considered two canonical function 
examples and proposed corresponding estimators with good statistical properties. 

Numerical comparisons with a standard TDMA have shown that the proposed analog computa- 
tion scheme has the potential to achieve huge performance gains in terms of computation accuracy 
or computation throughput. In addition to the weaker requirements regarding the synchronization 
of sequences, the scheme needs no explicit protocol structure, which significantly reduces the 
overhead. Computation schemes following the described design rule are therefore energy and 
complexity efficient and can be easily implemented in practice. Finally, the hardware-effort is 
reduced as well since energy consuming digital components (e.g., analog-to-digital converters, 
registers) are not required. Note that the proposed computation scheme can be used as a building 
block for more complex in-network processing tasks. 

Appendix 

A. Proof of Proposition [7] 

Let g' v := Mg^ and h'^ := l/Mh^ so that we have to show that ^(Efcfl^fc)) = EfeCfc wim 
£ [^min, <Pmat\>k £ holds if and only if g v and h v are affine functions. The "<=" direction is 
trivial, while the other direction is shown by contradiction. Suppose g' is bijective and continuous 
but not affine. Then there exist two points (£1, . . . , and (£1, . . . , m [^mm, ^Pmax\ K with 
Efc^fc ^ E fc 4 but Efc^fe) = E*0j>(£*)- By the last equation, we have 

E*& = ^(E* 5^(60) = ^(Efe^(l&)) = Efclfc > 

which however contradicts Efc'Cfe 7^ Efc£fc- Hence, g' is affine and so is g v . Moreover, we have 
^v(Efc#L(£fc)) = h<p(Mg<p(52 k £,k) + c) for some c G R, from which we conclude that h v is an 
affine function as well with h v = g^ 1 — c and some constant c G R that depends on g^. 

5. Proof of Lemma [2] 

Since Z\ 3 ~ xLr» the probability density of Z\ 3 is ^ 3 (x) = ^ ^ (M) x A/ ~ 1 l [0)Oo) (x), 
where -T(z) with Re{z} > is used to denote the Gamma function. Hence, one obtains 

J 

Now assume a% log e (a) < a geo KM and note that r{z) = J °° x z ~ 1 e~ x dx = k z J °° x z ~ l e~ kx dx, 
Re{z} > 0, which holds for any Re{£;} > 001. So substituting this into ([35]) with an 
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appropriately chosen k proves (i). As for (ii), if a 2 N log e (a) < a geo KM, then it follows from (i) 



M 2 i / \ 



that lim M ^oo - r- „ = liniM^oofl + £) M = e «, where u : 



a ge „i : s:M-^log e (a)y JH-+tX5V^ ' M J » yy^~ u, . . 

C. Proof of Lemma \3\ 

Let be an arbitrary compact set and < oo any fixed natural number. By Section 
IlLCl and Definition we know that S\x = ^(^^) = a^i^ 17 ^, if, a geo > 0, a > 1. 
Since ^ is continuous and strictly increasing, Ps(£\x) = F(E < £\X = x) = P(Z\ < 
a ge oi^Mlog a (^)|X — x) — PA(a geo K M \og a (£)\x), £ > 0. Thus, bearing in mind Corollary [Q 
we can conclude that, as M sufficiently large, A\x can be approximated by a random variable 
A\x ~ J\f^(Ma%, a^ x ). An immediate consequence of this is that for sufficiently large values 
of M, the distribution function of A\x can be approximated by P^(8\x) = \ + | erf ( & ^J N ) 
(i.e., P^(5|a;) ~ P^(5|aj)). Moreover, for M large enough, the Mann-Wald theorem 071 p. 356] 
implies P s {i\x) « Fg^lx) = P^(« geo KMlog a (0l^), where (f G R++) 

P / ™, / t M \ 11 f f Q ge o^Mlog c (Q - a|Mlog^a) A 

P^(a geo KMlog a (0l^) = - + - erf — . (36) 

I I \ V2 loge^ajcr^^ / 



Note that (1361) describes the distribution function of a log-normally distributed random variable 
with parameters aN ^° Sc ^ —■ /J>s and ( ^^ cr^x) 2 =: ^siar Thus S\x is approximated by a 
random variable Hlac ~ CM(fis,a%, x ). 

D. Proof of Proposition |6| 

Note that it is sufficient to show (|33l) . Because \E\x\ = ^(x) -1 S\x — f3(x)\ is continuous in 
E\x, Lemma [3] and the Mann-Wald theorem allow for the approximation of \S\x\ by \E\x\ = 
|7(cc)~ 1 S , |cc — P(x)\, where the probability distribution function of S\x ~ CAf(fJ,s,cr'w\ x ) is 
given by £36]). Since < /3(x),~/(x) < oo, we have P(|P| > e\X = x) » P(|P| > e|X = aj) = 
1 - P(-e < E < e\X = x) = 1 - P(-e < 7 (z;)- 1 H - (3{x) < e\X = x) which leads to 

Jl-P^p+^eJI^+PsCp-^e)!*), 0<e</3( a; ) 
P(|P| > e|A = x) = < (37) 

I l-P^(p + (aj,e)|aj), /3(cc) < e < oo 

where p + (x,e) := 7(0;) (/3(a:) + e) and p~(a:,e) := 7(0;) (/3 (a;) — e). Inserting the right-hand side 
of ((361) into expression (|37|) and using erfc(x) = 1 — erf(x), for all x G R, shows (1331) and thus 
completes the proof. 
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